NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Efficiently Constructing Sparse Navigable Graphs

Conway, Alex; Dhulipala, Laxman; Farach-Colton, Martin; Johnson, Rob; Landrum, Ben; Musco, Christopher; Shechter, Yarin; Suel, Torsten; Wen, Richard (January 2026, ACM-SIAM Symposium on Discrete Algorithms (SODA).)

Full Text Available
Mitigating False Positives in Filters: To Adapt or to Cache?

https://doi.org/10.1145/3786324

Mo, Tianchi; Bender, Michael A; Das, Rathish; Farach-Colton, Martin; Tench, David (December 2025, ACM Transactions on Database Systems)

Recent work has investigated adaptive filters, which are filters that change their internal representation in response to queries that yield false positives. These include: (1) strongly adaptive filters, which guarantee a false-positive probability of at most ϵ for any query regardless of the history of prior queries, i.e., against adaptive adversaries, (2) support-optimal filters, which guarantee an average false-positive probability of at most ϵ over sufficiently large query sequences, when the adversary is oblivious, (3) other adaptive filters that change their representation and empirically perform better, but do not come with any specific provable guarantees beyond static filters. In this paper, we investigate the performance advantages that strongly adaptive filters offer on (non-adversarial) skewed query distributions, which are common in database applications. In our theoretical and experimental results, we model query distribution skewness with the Zipfian distribution with parameterz. We consider two strongly adaptive filters: the broom filter and the telescoping adaptive filter (TAF). We also consider two adaptive (but not strongly adaptive) filters: the adaptive cuckoo filter (ACF), and a non-adaptive rank-and-select quotient filter augmented with a cache of recent false positives, which we call the cache-augmented filter (CAF). We prove upper bounds on the false-positive rates of the broom filter, the TAF, and the CAF as a function of the Zipfian parameterzas the length of the query sequence tends to infinity. We provide an implementation of the broom filter, based on the (non-adaptive) rank-and-select quotient filter. We validate the above bounds experimentally on synthetic Zipfian query sequences on the broom filter, the TAF, and the CAF. Finally, we measure the observed false-positive rate of the broom filter, the TAF, the CAF, and the ACF on highly skewed real-world network trace data. We find that all adaptive filters achieved 1-2 orders of magnitude lower false-positive rates than non-adaptive filters. We further find that the broom filter and the TAF outperform the CAF only when the ratio of distinct negative queries to positive set size is high; otherwise, the CAF and the strongly adaptive filters yield similar false-positive rates.
more » « less
Full Text Available
Exploring the Landscape of Distributed Graph Sketching

Tench, David; West, Evan; Zhang, Kenny; Bender, Michael A; Delayo, Daniel; Farach-Colton, Martin; Gill, Gilvir; Seip, Tyler; Zhang, Victor (January 2025, SIAM)

Full Text Available
Don't Melt Your Cache: Low-Associativity with Heat-Sink

https://doi.org/10.1145/3694906.3743303

Bender, Michael A; Conway, Alex; DeLayo, Daniel; Farach-Colton, Martin; Han, Jaehyun; He, Linfeng; Johnson, Rob; Kannan, Sudarsun; Kuszmaul, William; Porter, Donald; et al (July 2025, ACM)

Full Text Available
A Nearly Quadratic Improvement for Memory Reallocation

https://doi.org/10.1145/3626183.3659965

Farach-Colton, Martin; Kuszmaul, William; Sheffield, Nathan S; Westover, Alek (June 2024, ACM)

Full Text Available
History-Independent Dynamic Partitioning: Achieving Operation-Order Privacy in Ordered Data Structures

Bender, Michael A; Farach-Colton, Martin; Goodrich, Michael T; Komlos, Hanna (June 2024, 47th ACM SIGMOD-SIGACT-SIGAI Symposium on Principles of Database Systems (PODS))
History-Independent Dynamic Partitioning: Achieving Operation-Order Privacy in Ordered Data Structures

Bender, MICHAEL A.; FARACH-COLTON, Martin; GOODRICH, Michael T.; KOMLOS, Hanna (June 2024, PODS)

Full Text Available
History-Independent Concurrent Objects

https://doi.org/10.1145/3662158.3662814

Attiya, Hagit; Bender, Michael A; Farach-Colton, Martin; Oshman, Rotem; Schiller, Noa (June 2024, 43rd ACM Symposium on Principles of Distributed Computing (PODC))

Full Text Available
Layered List Labeling

https://doi.org/10.1145/3651602

Bender, Michael A.; Conway, Alex; Farach-Colton, Martin; Komlos, Hanna; Kuszmaul, William (June 2024, PODS)

Full Text Available
Adaptive Quotient Filters

https://doi.org/10.1145/3677128

Wen, Richard; McCoy, Hunter; Tench, David; Tagliavini, Guido; Bender, Michael A; Conway, Alex; Farach-Colton, Martin; Johnson, Rob; Pandey, Prashant (October 2024, Proceedings of the ACM on Management of Data)

Filters trade off accuracy for space and occasionally return false positive matches with a bounded error. Numerous systems use filters in fast memory to avoid performing expensive I/Os to slow storage. A fundamental limitation in traditional filters is that they do not change their representation upon seeing a false positive match. Therefore, the maximum false positive rate is only guaranteed for a single query, not for an arbitrary set of queries. We can improve the filter's performance on a stream of queries, especially on a skewed distribution, if we can adapt after encountering false positives. Adaptive filters, such as telescoping quotient filters and adaptive cuckoo filters, update their representation upon detecting a false positive to avoid repeating the same error in the future. Adaptive filters require an auxiliary structure, typically much larger than the main filter and often residing on slow storage, to facilitate adaptation. However, existing adaptive filters are not practical and have not been adopted in real-world systems for two main reasons. First, they offer weak adaptivity guarantees, meaning that fixing a new false positive can cause a previously fixed false positive to come back. Secondly, the sub-optimal design of the auxiliary structure results in adaptivity overheads so substantial that they can actually diminish overall system performance compared to a traditional filter. In this paper, we design and implement the \sysname, the first practical adaptive filter with minimal adaptivity overhead and strong adaptivity guarantees, which means that the performance and false-positive guarantees continue to hold even for adversarial workloads. The \sysname is based on the state-of-the-art quotient filter design and preserves all the critical features of the quotient filter such as cache efficiency and mergeability. Furthermore, we employ a new auxiliary structure design which results in considerably low adaptivity overhead and makes the \sysname practical in real systems. We evaluate the \sysname by using it to filter queries to an on-disk B-tree database and find no negative impact on insert or query performance compared to traditional filters. Against adversarial workloads, the \sysname preserves system performance, whereas traditional filters incur 2× slowdown from adversaries representing as low as 1% of the workload. Finally, we show that on skewed query workloads, the \sysname can reduce the false-positive rate 100× using negligible (1/1000th of a bit per item) space overhead.
more » « less
Full Text Available

« Prev Next »

Search for: All records